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Abstract 

When two independent analog signals, X and Y are added together giving Z = X + Y, 
the entropy of Z, H(Z), is not a simple function of the entropies H{X) and H(Y), but 
rather depends on the details of X and Y's distributions. Nevertheless, the entropy power 
inequality (EPI), which states that e 2H<yZ ^ > e 2H ^ +e 2H ^ Y \ gives a very tight restriction 
on the entropy of Z. This inequality has found many applications in information theory 
and statistics. The quantum analogue of adding two random variables is the combination 
of two independent bosonic modes at a beam splitter. The purpose of this work is to 
give a detailed outline of the proof of two separate generalizations of the entropy power 
inequality to the quantum regime. Our proofs are similar in spirit to standard classical 
proofs of the EPI, but some new quantities and ideas are needed in the quantum setting. 
Specifically, we find a new quantum de Bruijin identity relating entropy production under 
diffusion to a divergence-based quantum Fisher information. Furthermore, this Fisher 
information exhibits certain convexity properties in the context of beam splitters. 
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A The Q-function: Definition and Properties [27 

1 Introduction 

1.1 The classical entropy power inequality: formulation 

In his 1948 paper |Sha48] . Shannon identified the (differential) entropy 



H(X) = - J f x {x)\ogf x {x)dx 



of a random variable X with support on W 1 as the relevant measure for its information content. 
Using differential entropies, Shannon obtained an expression for the capacity of a noisy channel 
involving continuous signals. As a corollary, he explicitly computed the capacity of the white 
thermal noise channel. These are among his most well-known contributions to continous- 
variable classical information theory. However, Shannon also recognized another fundamental 
property of differential entropy called the entropy power inequality (EPI). 

Consider a situation where an input signal (or random variable) X is combined with an 
independent signal Y, resulting in an output X + Y with probability density function given by 
the convolution 

fx + v(z) = J fx(x)fy(z - x)d n x for all z G R n . (1) 
We may think of the operation 

(fx,f Y )^fx+Y (2) 

as an 'addition law' on the space of probability density functions. Shannon's entropy power 
inequality relates the entropies of the input- and output variables in this process. It is commonly 
stated as 

e 2H{X+Y)/n y e 2H(X)/n + g 2H(y)/n _ ^ 

It should be stressed that inequality ([3]) holds without assumptions on the particular form of 
the probability density functions fx, fy beyond certain regularity conditions. Specialized to 
the case where one of the arguments in (|2J), say, fy, is a fixed probability density function 
(e.g., a Gaussian), inequality (EJ) immediately gives a lower bound on the output entropy of the 
so-called additive noise channel X i— )■ X + Y (e.g., the white thermal noise channel) in terms 
of the entropy H(X) of the input signal. Given this fact and the fundamental nature of the 
operation (J2J), it is is hardly suprising that the EPI ([3]) has figured prominently in an array of 
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information-theoretic works since its introduction in 1948. For example, it has been used to 
obtain converses for the capacity of the Gaussian broadcast channel, and to show convergence 
in relative entropy for the central limit theorem |Bar86j . Other uses of the entropy power 
inequality in multi-terminal information theory may be found e.g., in |GK02j . 

The expression entropy power stems from the fact that a centered normal (or Gaussian) dis- 
tribution fx(x) = ( 27RJ 2)n/2 e~" x " /( 2cr > on R n with variance a 2 has entropy H(X) = ~ \og2nea 2 . 

Hence the quantity J-e M ^'" is the variance of X, commonly referred to as its power or 
average energy. For a general random variable Z, the quantity j—e 2H ( z ^ n is the power of a 
Gaussian variable with the same entropy as Z. This fact has been connected |CC84] to Brunn- 
Minkowski-type inequalities bounding the volume of the set-sum of two sets in terms of their 
spherical counterparts, which in turn inspired generalizations to free probability theory [SV96] . 
In the following, we will omit the factor and refer to e 2H ( z >' n as the entropy power of Z. 

The entropy power inequality ([3]) can be reeexpressed as a kind of 'concativity' property of 
the entropy power under a slightly modified 'addition rule' (X, Y) i— y X EB1/2 Y. To define the 
latter, let fiX denote the random variable with probability density / M x(^) — -/x(~)> ^ £R" 
for any scalar /i > 0. We define X EH 1/2 Y as the result of rescaling both X and Y by l/y/2 
and subsequently taking their convolution (see Eq. (j3J) below). Because differential entropies 
satisfy the scaling property H(pX) = H(X) + nlog/i for /x > 0, this leads to the following 
reformulation of the entropy power inequality (j3J): 

e 2H(Xm 1/2 Y)/n > \ e 2H{X)/n + \ e 2H{Y)/n _ ( C EPI) 

Observe that the rhs. is the equal-weight average of the entropy powers of X and Y . 

It turns out that the entropy H(X) itself satisfies a concativity inequality analogous to (IcEPIl) 
(see (IcEPI'l) below). More generally, introduce a parameter < A < 1 controlling the relative 
weight of X and Y and consider the addition rule 

UxJy)-* fxm x v where fxsi\Y = f \/\x+\/T—xy ■ ( 4 ) 

Note that this map is covariance-preserving when applied to two random-variables X and Y 
whose covariances coincide, a property not shared by convolution without rescaling (i.e., (j2])). 
The concativity inequality for the entropy takes the form 

H(X ffl A Y) > XH(X) + (1 - X)H(Y) for < A < 1 . (cEPf) 

The set of inequalities (jcEPI'l) can be shown |Lie78[ IVG06j to be equivalent to OcEPIj) . We will 
refer to both (IcEPIl) . flcEPFI) as entropy power inequalities; both express a type of concativity 
under the addition law (HI). 



1.2 The quantum entropy power inequality: formulation 

Here we formulate and subsequently outline a proof of a quantum version of the entropy power 
inequality. It is phrased in terms of the von Neumann entropy 

S(X) = -tr(p x logpx) 

of a state px on a separable infinite-dimensional Hilbert space describing n bosonic degrees of 
freedom. In quantum optics terminology, it gives a relation between the entropies of states going 
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in and coming out of a beamsplitter. Central to this program is the identification of the correct 
definitions applying to the quantum case. This is what we consider our main contribution. 

Our 'quantum addition law' generalizes (Jl]) and is most conveniently described for the case 
of two single-mode (n = 1) states px and py (although the generalization to n > 1 modes 
is straightforward, see Section 12.21) . It takes the following form: the Gaussian unitary U\ 
corresponding to a beam-splitter with transmissivity < A < 1 is applied to the two input 
modes. This is followed by tracing out the second mode. Formally, we have (see Section 12.21 
for more precise definitions) 



(Px,Py) >-» Pxm x Y = £\{px ® Py) where S x {p) = tr 2 UxpU{ 



x ■ 



(5) 



This choice of 'addition law' is motivated by the fact that the resulting position- and momentum 
operators (Q, P) of the output mode in the Heisenberg picture are given by 



Q =V\Q 1 + VT^XQ 2 

p =Vxp 1 + VT^xp 2 



(6) 



in terms of the original operators of the first (Qi,Pi) and second {Q21P2) mode. Hence ([6]) 
indeed mimics the classical addition rule (jl]) in phase space. 



Having defined an addition law, we can state the generalizations of (IcEPIj) and ( jcEPI'j) to 
the quantum setting. They are 



^S(XSi 1/2 Y)/n 



> 1 S(X)/n , 1 5(V 
2 2 

S(X ffl A Y) > \S(X) + (1 - X)S{Y) 



for A E [0, 1] . 



(qEPI) 
(qEPI') 



Observe that ( qEPI ) differs from (IcEPIj) by the absence of a factor 2 in the exponents. This 
may be attributed to the fact that the phase space of n bosonic modes is 2n-dimensional. We 



emphasize, however, that neither ( qEPI ) nor ( qEPI 7 ) appear to follow in a straightforward 
manner from their classical counterparts. A further distinction from the classical case is that, 



to the best of our knowledge, Eq. ( qEPI ) and (qEPT) do not appear to be equivalent. 

We will not discuss implications of these inequalities here (but see |KS12] for an upper 
bound on classical capacities), but expect them to be widely applicable. We point out that the 
importance of a quantum analog of the entropy power inequality in the context of additive noise 
channels was recognized in ealier work by Guha et al. |GES07j . These authors proposed a dif- 
ferent generalization of ( IcEPIj) . motivated by the fact that the average photon number tr (a^ap) 
of a state p is the natural analog of the variance of a (centered) probability distribution. In 
analogy to the definition of (classical) entropy power, they defined the photon number of a 
state p as the average photon number of a Gaussian state o with identical entropy. Their 
conjectured photon number inequality states that this quantity is concave under the addition 
rule (X,Y) h-> X EEU Y. We find that our more literal generalization (qEPI) (for A = 1/2) 



allows us to translate a proof of the classical entropy power inequality to the quantum setting. 
The formulation (qEPI) also yields tight bounds on classical capacities |KS12j for thermal noise 
associated with the transmissivity-l/2-beamsplitter. We conjecture that the quantity e s( - x ^ n 
is concave with respect to the addition rule EBa for all < A < 1 (a fact which is trivial in the 
classical case due to the scaling property of entropy). If shown to be true, this establishes tight 
bounds on the classical capacity of thermal noise channels, for all transmissivities. 
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1.3 The classical entropy power inequality: proof sketch 



It is instructive to review the basic elements of the proof of the classical entropy power inequal- 
ity. Roughly, there are two different proof methods: the first one relies on Young's inequality in 
the version given by Beckner |Bec75j . and Brascamp and Lieb [BL76] . The second one relies on 
the relation between entropy and Fisher information, known as de Bruijin's identity. For a de- 
tailed account of the general form of these two approaches, we refer to |DCT91j . While |DCT 91j 
gives a good overview, it is worth mentioning that more recently, Rioul has found proofs of the 
EPI that manage to rely entirely on the use of 'standard' information measures such as mutual 
information [Rio06t IRiollj . His work also involves ideas from the second approach. 

Here we focus on the second type of proof, originally introduced by Stam [Sta59] and further 
simplified and made more rigorous by Blachman [Bla65j and others [Bar86t IDCT91] . Its main 
components can be summarized as follows: 

(i) Gaussian perturbations: For a random variable X on W 1 define 

X t = X + VtZ t>0 (7) 

where Z be a standard normal with variance a 2 = 1. It is well-known that the family of 
distributions f t = fx t satisfies the heat or diffusion equation 

F) N f) 2 

s /, = A/„ A^JJ (8) 

j=i j 

with initial condition fo(x) = fx(x) for all x G Mr. In particular, diffusion acts as a 
one-parameter semigroup on the set of probability density functions, i.e., 

X h+t2 = (X tl ) t2 for all h,h>0. (9) 



Time evolution under (JSJ) smoothes out initial spatial inhomogenities. In the limit t — > oo, 
the distribution of X t becomes 'simpler': it approaches a Gaussian, and the asymptotic 
scaling of its entropy H(X t ) ~ g(t) is independent of the initial distribution f X - This is 
crucial in the proof of the EPI. 

A second essential feature of the evolution (jSJ) is that it commutes with convolution: we 
have (for any < A < 1 and t > 0) 

X t ffl A Y t = (X ffl A Y) t (10) 

for the time-evolved versions (X t ,Y t ) of two independent random variables (X,Y). 

de Bruijin's identity: A third property of the diffusion process (JTJ) concerns the rate 
of entropy increase for infinitesimally small times, t — y 0. de Bruijin's identity relates this 
rate to a well-known estimation-theoretic quantity. It states that 



d 
dt 



H(X + ViZ) = -J(X) where J(X) = [ (V f(x)f '(V f(x))d N x . (11) 
t=o 2 J 
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The quantity J(X) is the Fisher information J(X) = J(f^;9)\g=o associated with the 
family {f^}eeR of distributions 

=f(x-6-e) forallzGR" where e = (l,...,l) . (12) 

In other words, the random variables {X^}g are obtained by translating X by an un- 
known amount 9 in each direction, and J(X) relates to the problem of estimating 9 (by 
the Cramer- Rao bound). More importantly, the quantity J(X) inherits various general 
properties of Fisher information, generally defined as 

J(f (e) ;0)\ e=6o = J (^f {9) (x)J f^\x)dx\ e=eo (13) 

for a parameterized family {/^} of probability distributions on R (this is replaced by a 
matrix in the case n > 1). 

The link between entropy and Fisher information expressed by de Bruijin's identity is 
appealing in itself. Its usefulness in the proof of the EPI stems from yet another com- 
patibility property with convolution. Translations as defined by ( II 2p can be equivalently 
applied before or after applying addition (convolution), that is, we have 

(Xm x Y) {e) = x (VIe) ffl A F (v/T3Ie) for < A < 1 and 9 G R . (14) 

In other words, adding (convolving) X and Y and then translating by 9 is equivalent to 
adding up translated versions X^ 6 *), 

Convexity inequality for Fisher information: This shows that Fisher information 
behaves in a 'convex' manner under the addition law 01]), i.e., 

J(Xm x Y) < XJ(X) + (1-X)J(Y) . (15) 

A related inequality for the addition law when A = 1/2 is known as Stam's inequality; it 
takes the form 

? 11 

>—— + ——. (16) 



J(X ffl 1/2 Y) ~ J(X) JiY) 

Proofs of such inequalities have been given in |Sta59t IBla65t ICC84] . An insightful proof 
was provided by Zamir |Zam98] . relying only on basic properties of Fisher information. 
The most fundamental of those is the information-processing inequality, which states 
that Fisher information is non- decreasing under the application of a channel (stochastic 
map) £, 

J(S(f%9)\ e ^<J(f^9)\ e ^. (17) 



The Fisher information inequalities ( [151) and ( TlBj) are simple consequences of this data- 
processing inequality and the compatibility (114j) of convolution with translations. For 
example, the quantities on the two sides of (Tl5l) are the Fisher informations of the two 
families of distributions {X^}g and {(X^^ e \ Y^ l ~ xe ^)}e, and are directly associated 
with both sides of the compatiblity identity (JTJ 
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The entropy inequalities flcEPIj) and (IcEPI'l) follow immediately from (i)-(iii). Concretely, let 



us describe the argument of Blachman [Bla65j adapted to the proof of flcEPI'|) (The derivation 
of flcEPip is identical in spirit, but slightly more involved. It uses ([To]) instead of (|15p .). Note 



that a similar proof which avoids the use of asymptotically divergent quantities is provided 
in |DCT91] , yet we find that the following version is more amenable to a quantum generalization. 
The core idea is to consider perturbed or diffused versions 

X t = X + Vtz l 
Y t = Y + ViZ 2 , 

of the orignal random variables (X, Y), where t > and where Zx, Z 2 are independent random 
variables with standard normal distribution. This is useful because in the infinite-diffusion 
limit t oo, the entropy power inequality is satisfied trivially for the pair (X t ,Y t ) by (i). 
Furthermore, de Bruijin's identity (ii) and the Fisher information inequality (Hi) provide an 
understanding of the involved entropies in the opposite limit, where the diffusion only acts 
for infinitesimally short times. This allows one to extend the result from infinitely large times 
down to all values of t including 0. In the latter case, there is no diffusion and we are dealing 
with the original random variables (X, Y). 



Sketch proof of flcEPl 7 ]) . In more detail, introduce the quantity 

5(t) := H(X t ffl A Y t ) - XH(X t ) - (1 - X)H(Y t ) (18) 

measuring the amount of potential violation of the entropy power inequality for the pair (X t , Y t ). 
The entropy power inequality (IcEPI'l) for (X t , Y t ) is equivalent to the statement 5(t) > 0. Since 
(X ,Y ) = (X, Y), the entropy power inequality (IcEPI'j) amounts to the inequality 

5(0) > . (19) 

To show (|T9|) . observe that in the infinite-diffusion limit t — > oo, we have 

lim 5(t) = (20) 

t— >oo 

as all entropies H(X t ) ~ H(Y t ) ~ H(X t S\Y t ) have the same asymptotic scaling. Here we used 
the compatibility f lTU]) of diffusion and the addition rule (jl]). de Bruijin's identity (i) and the 
convexity inequality for Fisher information (ii) imply that the function 5(t) is non-increasing, 
i.e, 

5(t) > for all t > . (21) 

Indeed, we can use the semigroup property ()9]) of diffusion to show that it is sufficient to 
consider infinitesimal perturbations, i.e., the derivative of 5(t) at t — 0. But the condition 

5(0) > 

is simply the Fisher information inequality ( TT5|) . thanks to de Bruijin's identity (jTTj) and the 
compatibility fllOp of diffusion with convolution. Hence one obtains (|2T|) which together with 
Eq. ([20]) implies the claim ([11]). □ 
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1.4 The quantum entropy power inequality: proof elements 

Here we argue that a careful choice of corresponding quantum-mechanical concepts guarantees 
that the remarkable compatibility properties between convolution, diffusion, translation, en- 
tropy and Fisher information all have analogous quantum formulations. This yields a proof of 



the quantum entropy power inequalities (qEPI) and (qEPT) that follows exactly the structure 



of the classical proof, yet involves fundamentally quantum-mechanical objects. 
In this section, we briefly motivate and discuss the choices required. 

(i) A quantum diffusion process: For a state p of n modes, we define 

Pt = e tc {p) t > (22) 

as the result of evolving for time t under the one-parameter semigroup {e tc } t >o generated 
by the Markovian Master equation 

- Pt = C(jh) £(•) = -4 E Wi' ^ + ft' 'ID • ^ 

This choice of Liouvillean C is motivated (as in |HalOO] ) by the fact that it is the natural 
quantum counterpart of the (classical) Laplacian (cf. dHJ) 



A (-) = j E fe> •» + fe> fe' •») 



4 

acting on probability density function on the symplectic phase space M 2n of n classical 
particles (the factor | is introduced here for mathematical convenience). Indeed, C is 
obtained from the Dirac correspondence {•, ■} (->■ •] between the Poisson bracket and 
the commutator, as well as phase space variables and canonical operators, {qj,Pj) (->■ 

Alternatively, one can argue that the Wigner functions {W t } of the states {p t } defined by 
the evolution (f22|) obey the heat equation (jSj) on IR 2 ". 

Hall [ Hal OO] has previously examined the relation between the Master equation (1231) and 
classical Fisher information. He argued that the position- and momentum density func- 
tions ft{x) = (x\p t \x) and f t (p) = (p\p t \p) obey the heat equation (jSJ). Correspondingly, 
he obtained de Bruijin-type identities for the measurement-entropies when measuring p t 
in the (overcomplete) position- and momentum eigenbases, respectively. These results 
are, however, insufficient for our purposes as they refer to purely classical objects (derived 
from quantum states). 

(ii) Divergence-based quantum Fisher information: The information processing in- 
equality (fl?l) is well- known to be the defining property of classical Fisher information: A 
theorem by Cencov |Cen81 ICam86j singles the Fisher information metric as the unique 



monotone metric (under stochastic maps) on the manifold of (parametrized) probability 
density functions. Unfortunately, such a characterization does not hold in the quantum 
setting: Petz [Pet96j established a one-to-one correspondence between monotone metrics 
for quantum states and the set of matrix-monotone functions. 
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It turns out that the correct choice of quantum Fisher information in this context is given 
by the second derivative of the relative entropy or divergence S(p\\cr) = tr(p log p— p log a), 
that is, 

d 2 



J(p {0) ;0)„ . =^(p w IIp w ). . • (24) 



6=6 d9 2 e=e 
This choice is guided by the fact that the classical Fisher information (fT3l) can also be 
written in this form, that is, J(/W; 9) \ e=f)o = ^D(f^\\f^) ^ ^ , where D(f\\g) = 

J f(x) log^M is the classical relative entropy We call the quantity ( 124"]) the divergence- 
based (quantum) Fisher information. 

Lesniewski and Ruskai |LR99j have studied metrics obtained by differentiation of a 'con- 
trast functional' 5'(-||-) in a similar manner as (1241) . They found that every monotone 
Fisher information arises from a quasi-entropy in this way In particular, the data- 
processing inequality ffl7|) is a consequence of their general result. We will discuss an 
independent elementary proof of the data processing inequality for the divergence-based 
Fisher information in Section HI 

(iii) Translations of quantum states: These are simply translations in phase space. We 
use translated versions of a state p displaced in each direction, 

p(0,Qj) = e i8Pj pe -* eP J 

p (e,Pj) = e -ie Qjpe io Qj foTj = l,...,n. 
Clearly, this choice is dictated by the Heisenberg action on the mode operators, 

<■'"'■(),< ""' ■ 

e~ ieQi Pje ieQi = Pj + 6 , 

in analogy to the addition law (j6]). 
In the remainder of this paper, we verify that these definitions indeed satisfy the necessary 



properties to provide a proof of the quantum inequalities ( qEPI ) and (qEPT). A word of 
mathematical caution is in order here: several times we will need to interchange a derivative with 
an integral or infinite sum, typically taking a derivative inside a trace. This can be justified as 
long as the functions involved are sufficiently smooth. In order to focus on the main ideas in our 
argument, rather than pursue a detailed justification of the interchange of limits we will simply 
restrict our attention to families of states that are sufficiently smooth. This involves little loss 
of generality since we are interested in the entropy of states satisfying an energy bound. On this 
set of states, entropy is continuous, so one can hope to approximate non-smooth functions with 
smooth ones to obtain the unrestricted resu 110. As a result, smoothness requirements are not 
usually considered in proofs of the classical EPI |Sta59[ lBla65t IRio06l IVG06] or studies of related 
quantum ideas [GGL+041 IGHLMlOj . Developing a more mathematically thorough version of 
these arguments appears to be a formidable task (even in the classical case, see (Bar86j) which 
is left to future work. 



1 Alternatively, one could introduce a high-photon- number cutoff (well higher than the energy bound of the 
states under consideration) to make the state space finite, where these problems disappear, and take the limit 
as the cutoff grows. 
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2 Basic definitions 



2.1 Continuous-variable quantum information 

An quantum state of n modes has 2n canonical degrees of freedom. Let Qk and Pk be the 
"position" and "momentum" operators of the fc-th mode, for k — 1, . . . , n, acting on the Hilbert 
space H n associated with the system. Defining R = (Q 1 , P 1 , . . . , Q n , P n ), these operators obey 
the commutation relations 

[R k , R e ] = iJ M where J = 

The Weyl displacement operators are defined by 

= e * JR for (GM 2n . 
These are unitary operators satisfying the relations 

D(Z)D(r)) = e-& Jr >D(Z + r t ) 

and 

D(OR k D(O j = Rk + &I or (25) 
[D(0,Rk]=ZkD(0 ■ (26) 

This explains the terminology. Any state p takes the form 

P=(2n)- n j xM)D{-Od 2n t , (27) 

where 



*, a 



Xp(0 = tr(D(Op) (28) 

is the characteristic function of p. The characteristic function x P is well-defined for any Hilbert- 
Schmidt class operator p. The map p h-> ^ p is an isometry between the Hilbert space of these 
operators and the space L 2 (R 2n ) of square- integrable functions on the phase space: 

tr(pV) = (2tt)-" J 3S0x«r(0 for all p, a . (29) 

Observe that for Hermitian p = p* , we have Xp(0 — since -D(^)"!" = -D(— £) 

A Gaussian state p is uniquely characterized by the vector d = (di, . . . , rf 2 n) £ K 2n of first 
moments 



4 = tr(pi4) (30) 

and its covariance matrix 7 with entries 

'jki = tr (p{R k - d k , R e - di}) , (31) 
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where {A, B} = AB + BA. Its characteristic function is 

X(0 = exp(i£- Jd-e- • (32) 

The covariance matrix 7 of a quantum state satisfies the operator inequality 

7 > «V , (33) 

which is Heisenberg's uncertainty relation. Conversely, any matrix 7 satisfying ( 133|) and vector 
d G M 2n define a Gaussian quantum state via (l3"2l . Complex conjugating ( 1331 and adding it 
to itself shows that the covariance matrix 7 > is nonnegative. According to Williamson's 
theorem |Wil36] , there is a symplectic matrix S G Sp(2n, R), i.e., a matrix satisfying SJS T = J, 
such that 

S^S^ = diag(z/i, z/i, z/ 2 , z/ 2 , • • • , v n , v n ) with v 3 > (34) 

is diagonal. Eq. 031]) is called the Williamson normal form of 7, and {z^}./ are referred to as 
the Williamson eigenvalues. 

Symplectic transformations S are in one-to-one correspondence with linear transformations 
R i—)- B! := S*/? preserving the canonical commutation relations. Each S G Sp(2n, K) uniquely 
defines a unitary £/s on % n which realizes this map by conjugation, i.e., 

2n 

UlR k U s = Y, S ^Ri=-R'k • (35) 
£=1 

If S G Sp(2n, M) brings the covariance matrix of a Gaussian state p into Williamson normal 
form as in (1341 . then {-R^j-fc are the eigenmodes of p. In terms of the creation, annihilation and 
number operators 



a 



t 1 I 



nk = a k a,k j for /c = 1, . . . , n 

the state takes the form 

tr e~@ knk 



it=i 



In this expression, the inverse temperatures (3k are defined in terms of the symplectic eigen- 
values as 

(eft. - l)- 1 = [v k - l)/2 := JV(i/ fc ) for fc = 1, . . . , n . (37) 

The quantity ( 137|) is the mean photon number, i.e., the expectation value tr(Lfsp£/ga|.afc). Since 
the number operators have the spectral decomposition n& = Yl n >o n \ n ) ( n \ with integer eigenval- 
ues, the entropy S(p) = — tr(plog p) of a Gaussian state p can be evaluated from and ( 157)) 

as 

n 

S , (/9) = 5^^(JV(i/ fc )) where ^(A^) := (N + 1) log (AT + 1) — iVlogiV . (38) 
k=i 
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Note that this quantity does not depend on the displacement vector d. 

A CPTPM £ on H is called Gaussian if it preserves the set of Gaussian states, that is, 
the state p' := £(p) is Gaussian for all Gaussian input states p. A Gaussian operation £ is 
completely specified by its (Heisenberg) action on mode operators. The action of £ on Gaussian 
states is determined by a triple (X, Y, £), where £ G M. n is an arbitrary vector and X, Y are real 
2n x 2n matrices satisfying Y T = Y and Y + i(J — XJX T ) > 0. For a Gaussian state p with 
covariance matrix 7 and displacement vector d, the Gaussian state £(p) is described by 

7 ^ y : = X 7 X T + F (39) 
d M- d' : = d + £ . (40) 

More generally, the action on a general state p is determined by the action 

X P {0 X£(p)(0 = Xp(^Oe-^' n • (41) 

on characteristic functions. Eq. ( 141)) implies that the set of Gaussian operations is closed 
under composition. In the special case where £(p) = UpW is unitary, we have Y = and 
X =: S G Sp(2n, R). We call such a unitary [/ Gaussian and sometimes write U = Us to 
indicate it is defined by S (cf. ( 1351) ). 

A general Gaussian operation £ on H n has a Stinespring dilation with a Gaussian unitary 
S G Sp(2(n + m), M.) acting on the Hilbert space % n <g> "H m of n + m modes (for some m) and 
a Gaussian ancillary state ps on "H m , i.e., it is of the form 

£(p)=tY B (U s (p®p B )Ul) . (42) 

Since the operations of taking the tensor product and the partial trace correspond to taking the 
direct sum or a submatrix on the level of covariance matrices, identities (J39l) and (1421) translate 
into 

[S( 1 ® lB )S T ] 2n = X 1 X T + Y , 

where [-^n denotes the leading principle 2n x 2n submatrix and 7# is the covariance matrix 
of pb- It is again convenient to express this in terms of characteristic functions. First consider 
the partial trace: if p n+m has characteristic function Xp n +m(£ ■>£')■> where £ G M 2n , £' G R 2m , 
then the partial trace p n = tr m p n + m has characteristic function 

With (|4T|) . we get the transformation rule 

x P ^ xe( P m ■■= fix ® x B )(5(£,n)e^ { ^' ) - y{ ^ ,) rf 2m r , (43) 

where x_b is the characteristic function of the Gaussian ancillary state p B . Here (x®X-b)(£> CO : = 
x(0 ' Xb(£,') is the characteristic function of the product state p® p B . 
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2.2 Quantum addition using the beamsplitter: definition 



In this section, we specify the quantum addition operation (jSJ) in more detail. It takes takes 
two n-mode states px, Py and outputs an n-mode state pxm x Y = £\(px® Py), where < A < 1. 

To define the CPTP map E\, consider the transmissivity A-beam splitter. This is a Gaussian 
unitary U\, n , whose action on 2n modes is defined by the symplectic matrix 

S„ n =( fi^L ^^Wa, (44) 



Vl - A n -vAI„ 

where we use a tensor product {e^ , . . . , ejf , ef , . . . , ef } ® {q,p} to represent the two sets of 
modes (q* ,pf , . . . , q* ,P X ) an d (^fT' • • • > QniPn)- Note that the symplectic matrix takes the 
form /„ £g> J, with 



J 



1 
-1 



With respect to the n pairs (qf } pf),(qj ,qY) of modes, for j = l,...,n, we have 5 



A, n 



Sf™, hence Ux, n — ^xi corresponds to independently applying single-mode beam-splitters 
to each pair of modes. In the following, we will omit the subscript n, and simply write 

= ■■■,&,&) and £ y = (f? 1 , • • • , Z? n ,g n ) for the collection of phase space 

variables associated with first and second set of n modes. 

The map S\ is defined by conjugation with U\ and tracing out the second set of modes, i.e., 
it is 

£x(pxy) =ti Y U x p XY U{ . (45) 

Clearly, this is a Gaussian map. A bipartite Gaussian state pxy with covariance matrix and 
displacement vector 

7= (? X l XY ) , d = (d x ,d Y ) (46) 



K lYX JY _ 

gets mapped into a Gaussian state £\(pxy) with 

i = A7x + (1 - A)7y + a/ A(1 - A )(7 X y + 7yx) / 4? n 



This completely determines the action of E\ on Gaussian inputs, but we will also be interested 
in more general inputs of product form. To get an explicit expression, consider two states px, Py 
with characteristic functions xx, Xy- According to (jUJ) and (1441) . the state U\{p x ® Py)U^ has 
characteristic function 

£y) = Xx(VAfr + VT^Xfr) • Xy(VT^\£x - V\fr) ■ (48) 
It follows that £\{p ® p) has characteristic function 

Xs x ( Px ®p Y )(0 = f Xx(VXC + VT^X^y) ■ Xy(VT^\£ - V^yRy . (49) 
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2d 



3 A quantum diffusion equation 

Consider the diffusion Liouvillean 

C ^ = -4 E (ft?* ft?* ■]] + t p i> fo> •]]) = "i E^> [^^ii • ( 5 °) 

defined on n modes (cf. fl23|) ). We first establish the relevant properties of the one-parameter 
semigroup {e tc }t>o, where the CPTP map e tc describes evolution under the Markovian master 
equation 

j t p(t) = C(p(t)) . (51) 

for time t. We call this the diffusion semigroup. We will subsequently show that the maps e tc 
are compatible with beamsplitters (Section l3.ip . and analyze the asymptotic scaling S(p(t)) of 
solutions pit) = e tc (p) of flED) (Section El). 

The following is well-known (see e.g., [LGM+09] . where C is given in terms of creation- and 
annihilation operators) and usually stated without proof. 

Lemma 3.1. Let C be the Liouvillean ( I577j) . Then 

(i) The Liouvillean C is Hermitian with respect to the Hilbert- Schmidt inner product, i.e., 

tv(p£(a)) = tr(£(p)a) (52) 

for all states p, a. 

(ii) Let po be state with characteristic function Xpo(0> an d ^ p(f) = e * £ (Po) denote the 
solution of the Master equation (|5ip with initial value p(0) = po- Then e* £ (po) has 
characteristic function 



X W (0 = Xpo(Oexp(-||e|| 2 t/4) , (53) 



where U\\ 2 = ET=itl 



'3=1 S J 

(Hi) For any t > 0, the CPTPM e tc is a Gaussian map acting on covariance matrices and 
displacement vectors by 

d M-d' = d. 1 } 

Proof. If Xp{£) is the characteristic function of a state p, we can express the commutator [Rj, p] 
as 

by using ( 1Z7|) and the displacement property (12T)|) . Iterating this argument gives 
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hence 



C( P ) = --^ J X P (0U\\ 2 D(-0d 2 C ■ (55) 

In other words, the operator C(p) has characteristic function Xc(p)(0 — — zXp(0 ll£l| 2 - Together 
with (1291) . this immediately implies ( 15^|) . 

Let p(t) be the operator with characteristic function (1531) . By commuting derivative and 
integration, we have 

= ~A(hy S x ^ma 2 eM-U\\ 2 t/^)D(-0d^ 

Combined with fl55|) . this shows that p(t) is indeed the solution to the Liouville equation, 
i.e., p{t) = e tc (p). This proves (ii). 

Finally, property (iii) follows from (ii) since multiplying a Gaussian characteristic function 
by a Gaussian preserves its Gaussianity. □ 

3.1 Compatibility with the beamsplitter 

In the same way as convolution is compatible with the heat equation (cf. (ITU]) ), beamsplitters 
are compatible with diffusion defined by the Liouvillean C. This can be understood as a 
consequence of the fact that, on the level of Wigner functions (i.e., the Fourier transform of the 
characteristic function), both the beam-splitter map £\ and the diffusion e tc are described by 
a convolution. Here we give a compact proof based on the fact that these maps are Gaussian. 

Lemma 3.2. Let tx,ty > 0, < A < 1 and let C be the diffusion Liouvillean acting on n 
modes. Then 

S x o ( e txC ® e wc ) = e tc o £ x where t = Xt x + (1 - X)t Y . (56) 

Proof. According to Sections 12.21 and |3l both S\ and e tc are Gaussian maps, hence this is 
true also for both sides of identity fl56|) . This means it suffices to verify that they agree on 
all Gaussian input states pxy described by ( H6"j) . The claim follows immediately from ( H7|) 
and (El]): Indeed, both 8\(e txC Cg> e tYC )(p X Y) and e tc (£ x (pxY)) are described by 

l' = A 7 x + (1 - A) 7 y + tl 
d' = y/\d x + Vl - Xd Y ■ 

□ 
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3.2 Scaling of the entropy as t — > oo 

The following result will be essential for the proof of the entropy power inequality. It derives 
from a combination of arguments from |GGL + 04] and |Hir06] , as well as the maximum entropy 
principle |WGG06j . 

Theorem 3.3 (Entropy scaling under diffusion). Let p be an arbitrary (not necessarily Gaus- 
sian) state, whose covariance matrix has symplectic eigenvalues v\,..., u n . Let t > 2. Then 

n 

ng(N(t-l)) < S(e tc (p)) < J2g(N(t + v k )) , (57) 

k=l 

where g(N) = (N + 1) log(iV + 1) — N log iV is the entropy of one-mode state with mean photon 
number N and N(u) — [y — l)/2. 

Proof. The lower bound is essentially the lower bound on the minimum output entropy from |GGL + 04 



(33)], generalized to several modes. We recall the necessary definitions in Appendix |A] Any 
state can be written in the form 

p = J Q^a^v with a( V ) = JL J e^e^D(-C)^ , 

where Q is a probability distribution, i.e., Q{rf) > and J Q(r])d 2n r] = 1. The operators a(rj) are 
generally not quantum states. However, by linearity of the superoperator e tc , we can compute 

according to Lemma [3~T| which shows that e tc (a(r])) is a displaced thermal state with covariance 
matrix (t — l)/2n if i > 2. Because entropy is concave, we therefore get 



S(e^(p)) > J S(e^(a( V )))d^ V = ng(N(t - 1)) , 

and the claim follows. 

For the proof of the upper bound in (jSTj) . we can assume without loss of generality that p is 
Gaussian. This is because the maximum entropy principle [WGG06] states that for any state p, 
we have 

S(p) < S(p G ) , (58) 

where p G is a Gaussian state with covariance matrix identical to p. The remainder of the proof 
closely follows arguments in [Hir06]. It is a well-known fact that the entropy of a state p can 
be computed by taking the limit 

Sip) = lim ^^logtr// . (59) 
p->i+ 1 — p 

For a Gaussian state p whose covariance matrix has symplectic eigenvalues v = (u%, . . . , u n ), 
the expression on the rhs. is equal to [HSHOOJ 

tr P" = II 777T := where = + 1) P " - 1) P (60) 

k=1 Jp\ u j) r p\ u ) 



16 



for any p > 1. Weak submajorization, written x -< w y for two vectors x,y G M n , is defined by 
the condition 

k k 

x -< w y <^ — X^^i for fc = 1, . . . ,n , 

3=1 3=1 

where x\ > x\ > ■ ■ ■ > x^ and y\ > yjj? — ' ' * ^ Vn are decreasing rearrangements of x and 
y, respectively. In [Hir06j, it is argued that the function v i-> respects the partial order 

imposed by weak submajorization on MJ 1 , i.e., it has the property 

F p (x)>F p (y) if x< w y. (61) 

Furthermore, [Hir06] Theorem 1] states that for any 2n x 2n real positive symmetric matrices A 
and B, the vector of symplectic eigenvalues of their sum A + B is submajorized by the sum of 
the corresponding vectors of A and B, respectively, i.e., 

v{A + B) ^ w (u(A) + v{B)) . (62) 

Combining ( 159|) . (JBPj) . fIBT]) and (jB2J) gives the following statement. Let ^[7] denote the centered 
Gaussian state with covariance matrix 7. Let 7^ > and 7^ > be positive covariance 
matrices with symplectic eigenvalues u A , v B . Then 



S{pYia + 1b\) <S\p 



.i=i 



(63) 



This statement also applies in the case where 7^ > has vanishing symplectic eigenvalues; 
this follows by applying the same argument to Ya := 7a + el and letting e — > + . The upper 
bound f l57|) follows immediately from fl58|) and fl63|) applied to 7,4 = 7 and 7^ = t/2 n (a valid 
covariance matrix for t > 1). This is because for an initial state p with covariance matrix 7, 
the time-evolved state e (p) has covariance matrix 7 + £l 2n according to Lemma [3. II □ 

Using Theorem 13.31 we can show that the asymptotic scaling of the entropy (or the entropy 
power) is independent of the initial state p, in the following sense. 

Corollary 3.4. The entropy and the entropy power of e (p) grow logarithmically respectively 
linearly as t — > 00, with an asymptotic time- dependence independent of the initial state p. More 
precisely, we have 



\S(e tL (p))/n - (1 - log2 + log t) | < 0(l/t) 
,1 
'7 



1 e 5(e^(p))/n_ £, < 0(1/t) 



where the constants in O(-) depend on p. 
Proof. From the Taylor series expansions 



log(l + e) = e + 0(e 2 ) 
(l + e ) 1 A = e _^ e + ( e 2 ) 
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we get 

g(N) = \ogN + 1 + 0(1/N) 

e»W = (I + N) e + O(i) for iV -> oo . 

Replacing the rhs of (157)1 with the upper bound $^fc = i<7(-W(£ + i>%)) < nmax k g(N(t + z/ fc )) =: 
ng(N(t + z/*)) and using N(u) = {i> — l)/2 therefore gives 

log(f - 1) + 1 - 0(l/t) <S(e tc (p))/n < log(*+^) + 1 + 0(l/t) 



f e - |e - O(i) ' < e s ^)' n < ° |e + f e + O(f) ' for t ^ oo . 



for some constant z/* depending on p. The claim follows from this. □ 

A simple consequence of Corollary 13.41 is that e tc (p) converges in relative entropy to a 
Gaussian state: we have 

\im S(e tc (p)\\e tc (pf) = , (64) 

t— >oo 

where a G denotes the 'Gaussification' of cr, i.e., the Gaussian state with identical first and second 
moments. Indeed, because e tc is a Gaussian map (Lemma 13. ip . we have e tc (p) G = e tc (p G ) for 
any state p. Therefore, 

S(e tc (p)\\e tc (pf) = S(e tc (p)\\e tc (p G )) 

= -S(e tc (p)) - tr(e tc (p)loge tc (p G )) 
= -S(e tc (p)) -tr(e tc (pf\oge tc (p G )) 
= S(e tc (p)) -tr(e tc (p G )\oge tc (p G )) 
= -S(e tc (p)) + S(e tc (p G )) . 

In the third identity, we used the fact that \oge tc (p G ) is quadratic in the mode operators. 
Statement (|64l) now follows from Corollary 13. 4[ 



4 Divergence-based Quantum Fisher information 

In this section, we introduce the divergence-based Fisher information and establish its main 
properties. We restrict our focus to what is needed for the proof of the entropy power inequality. 
We refer to the literature for substantially more general results concerning quasi-entropies and 
quantum Fisher information (see e.g., |Pet961 [LR991 IPG101 [HMPBllj l 

Recall that the relative entropy or divergence of two (invertible) density operators p, a is 
defined as S(p\\cr) = trp(logp — logo"). The divergence is nonnegative and faithful, i.e., 

S(p\\o~) > and S(p\\cr) = if and only if p = a . (65) 

Furthermore, it is monotone, i.e., non- increasing under the application of a CPTPM S, 

S(S(p)\\S(a)) < S(p\\a) . (66) 
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Properties fl65|) and f )66|) tell us that 5'(-||-) may be thought of as a measure of closeness. A 
third important property of divergence is its additivity under tensor products, 



S(pi <8> P2IIC1 <8> cr 2 ) = S(pi\\ai) + S(p 2 \\a 2 ) 



(67) 



Consider a smooth one-parameter family h-> p( e ) of states. Using divergence to quantify 
how much these states change as we deviate from a basepoint #0 G M, it is natural to consider 
the function 9 1— > S(p^\\p^) in the neighborhood of 9 . In the following, we assume that it is 
twice different iable at #0 (but see comment below). According to ( 165]) . this function vanishes 
for 9 = 6q, and is nonnegative everywhere. This implies that its first derivative vanishes, i.e., 



d_ 

d9 



s(p {eo) \\p {e) : 







(68) 



One is therefore led to consider the second derivative, which we will denote by 

d 2 



Ap (8) 



1 a )\e=e Q 



d9 2 



S(pW\\pV>] 



=00 



(69) 



We call the quantity ( 1691) the divergence-based Fisher information of the family {p^}e- Its 
properties £11*6 cLS follows: 

Lemma 4.1 (Reparametrization formula). For any constant c, we have 

J(p^;9)\ e=eQ = C 2 J(p^;9)\ e= e 



J(p^ c) ;9)\ e=eo = J(p^;u )le=eo+c 
Proof. This is immediate from Definition ( 1691) . 
Lemma 4.2 (Non- negativity). The Fisher information satisfies 



Proof. Define f 0o (9) = S(p^ \\p^). We know that 

fe o (0)>0 for all 6 and f 0o (9 o )=O. 
The claim follows from this by writing the second derivative as a limit 



□ 



(70) 



d- 
d6 



2 S( P ^\\pW] 



de 2 



feoV) 



lim 

€->0 



2/0o (0o ) + f 60(60 



Lemma 4.3 (Additivity). For a family {p^ <8> Pb'}b of bipartite product states, we have 



J(p 



(0) 



--0o 



J(p 



e=e 



+ J(p ( b ) ;6) 



e=e 



Proof. This directly follows from the additivity fIBTl) of divergence. 



(71) 
□ 

(72) 
□ 
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Most importantly, the divergence-based Fisher information satisfies the data processing 
inequality. 

Theorem 4.4 (Data processing for Fisher information). Let £ be an arbitrary CPTPM. Then 

m P ^e)\ e ^<j{ P ^-e)\ e ^. 

Proof. The proof proceeds in the same way as a well-known proof for the information-processing 
property of classical Fisher information (see e.g., Ragllj ). We show here that our assumption 
that fe o (0) = S(p^\\p^) is twice differentiable at 9 = 9 is sufficient to give the desired 
inequality. From ( 1681) and the definition of Fisher information, we conclude that 



0=6>o 



9 ) 2 + o(\9-9 



(73) 



Let us argue that an analogous identity holds for the family {£(p( e ))}<j. Define ge (9) 
S{£{p 6 °)\\£{p e )). We know from ([65]), §k> and the monotonicity fl} that 



< ge Q (0) < fe Q (0) for all 9 and 

Therefore, the derivative of gg is 

d ,„,, ,. QoJ9^ + e) 



d9 



geM 



= hm 

0-00 e ^o 



e (0o) - fe (9 ) - -^fe Q {9) 



lim^ O + £) >0 



(74) 



On the other hand, ( 1741) also gives 



d 

~dl 



(a\\ v 9e {9o + e) 
9e n \y)\a a — hm— < hm 



! o + e) 



d9 



feM 



hence 



d_ 



. 



0=0o 



Similarly, writing the second derivative of ge as a limit as in ( ffTT) and using ( 1741) gives 

d 2 



< W JeM 



0=0o 



In summary, the function gg has vanishing first and bounded second derivative, which shows 
that 



1 



S{£{p<M)\\£{pW)) = -J(£(p {9) ); 9) -(9- 9 ) 2 + o(\9- 9 t 



0=00 



(75) 



The claim of the theorem now follows from (1731 . (I75|) and the data processing inequality ([66 



□ 
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One may worry about the differentiability of the function 9 i— > S(pg \\pg) for an arbitrary 
smooth one-parameter family {p^'}g on an infinite-dimensional Hilbert space. However, we do 
not need this full generality here. Throughout, we are only concerned with covariant families 
of states 

p {e) = e i{e-e ) Hp (e ) e -i{e-e )H for aU 9}9qER (76) 

generated by a Hamiltonian H. For a family of the form (1751) . we can easily compute the 
derivative 

^S(p m \\p ie) )\ 9=eQ = "tr (pW[iH,]ogpM]) , (77) 

In accordance with (1681) . this can be seen to vanish by inserting the spectral decomposition 
of p(flo). We can also give an explicit expression for the second derivative, the divergence-based 
Fisher information. It will be convenient to state this as a lemma. 

Lemma 4.5. Let {p^}e be a covariant family of states as in (175]) . Then 

A P {d) ;9)\ e=eo = tr(pW[H, [H,hgp^]}) . (78) 
Proof. Define £7(0) = e < e - e °^ H . Using £7(0)#£7(0)t = H, we calculate 

^logp w = [iH,U{0)Q°6P i0o) )U(ey] = U(6)[iH,logpM]U(6y , 



hence 



^logpW = [iH,U(P)[iH,*>6P™P(flV] . 



Therefore 



^S(p^\\p^) = -tr(pW^logpW) = -tr(p^ [iH,U(e)[iH,\ogp^]U(ey]) . 
Evaluating this at 9 = 8 gives the claim. □ 



5 The quantum de Bruijin identity 



The quantum analog of the translation rule (Tl2j) is defined in terms of phase space translations: 
For R e {Qj, Pj} define the displacement operator in the direction R as 

f e MPj if R = O 

Dr(6) = I . Rn ^ (79) 

V ; \e-^i if R = P j . v ' 

For a state p, consider the family of translated states 

pQM = D R (9)pD R (6)l 9 e R (80) 
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and its Fisher information J(p^ e ' K ^; 6)\ g _ Q - With a slight abuse of terminology, we will call the 
quantity 



2n 



k=l 

obtained by summing over all phase space directions the Fisher information of p. The quantum 
version of de Bruijin's identity (fTT]) then reads as follows: 

Theorem 5.1 (Quantum de Bruijin). Let C be the Liouvillean fl50|) . The rate of entropy 
increase when evolving under the one-parameter semigroup {e tc } t >o from the initial state p is 
given by 

j t S(e Ct (p))\ t=Q = ~J(p) . (82) 
Proof. We use the expression (see e.g., |Spo78j ) 



d 



j t S(e tL (p))\ t=0 = -tT(C(p)\ogp) . (83) 

for the rate of entropy increase under a one-parameter semigroup e tc . Because C is Hermitian 
(Lemma l3.ip . this is equal to 

j t S(e t£ (p))\ t=0 = -tr(pC(\ogp)) , (84) 
The claim now follows from the definition ( 15 Op of the Liouvillean C and Lemma 14.51 □ 

6 Fisher information inequalities for the beamsplitter 

6.1 Compatibility of the beam splitter with translations 

We first establish a slight generalization of the compatibility condition (|14p between the addition 
rule defined by the CPTPM E\ (cf. (14"5]) ) and translations D R in phase space (cf. (TT9|) ) . 

Lemma 6.1. Let 9,wx,wy G K and consider the CPTP maps 

F{p) = £x ((D R (w x 9) <g) D R (w Y 6)) p (D R (w x 6) ® D R (w Y 9)f 
Q{p) = D R (w9)£\(p)D R {w9)^ where w = \f~\wx + Vl — Xwy 

Then T = Q. In particular, 

£x(p ( x xe ' R) ® p ( y Ym ) = (£x(Px ® Py)) {w9 ' R) 

for any two n-mode states px and py, where p t— > p( d > R } is defined by (1HU1) . 



Proof. Since both T and Q are Gaussian (as compositions of Gaussian operations), it suffices 
to show that they agree on Gaussian inputs. Note that p \- > D R {9)pD R {9)^ leaves the covari- 
ance matrix p invariant while adding d! = d + 9 ■ e R to the displacement vector, where e R is 
the standard basis vector in the indirection. This, together with (14TI) . immediately implies 
that J= = Q. □ 
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6.2 Stam inequality and convexity inequality for Fisher information 

The following Fisher information inequality for the quantity J(p) (cf. ( 1HTT) ) is a straightforward 
consequence of the compatibility of beam-splitters with translations, and the data processing 
inequality for Fisher information. The arguments used here are identical to those applied in 
Zamir's proof |Zam98] (described transparently in |KY08] for the special case of interest here) 
of Stam's inequality. The following statement implies a quantum version of the latter and a 
convexity inequality of the form (IT5|) . 

Theorem 6.2 (Quantum Fisher information inequality). Let wx,wy G K and < A < 1. Let 

PxiPy be two n-mode states. Then 



w 2 J{£ x {px® Py)) < w x J{p x ) +w Y J(p Y ) 



where w = V\wx + Vl — Awy . 

Proof. Let R G {Qj, Pj}]=i and consider the family of product states {p£ x6 <8> Py 9 ' }e and 

the family {(£\(px ® Py)) }e- By Lemma [6TTI the latter is obtained by data processing 
from the former. By the data processing inequality for Fisher information, this implies 



J ((Sx(px ® py)) [wm 



6»=0 



and by the properties of Fisher information, this is equivalent to 



W 2 j({£ X {p X ®py)t e ' R) 



8=0 



< w x J(p x 



\e=o + w Y J(p { y' R) ; 



The claim is obtained by summing over all phase space directions R. 



□ 



The following convexity inequality for Fisher information is an immediate consequence of 
Theorem 16.21 obtained by setting wx = VX and wy = \/l — A. 

Corollary 6.3 (Convexity inequality for Fisher information). Let px,PY be n-mode states and 
< A < 1. Then 

J(£x(px ® Py)) < XJ(px) + (1 - X)J(py) . (85) 
We also obtain the following quantum analog of the classical Stam inequality (TTB|) . 



Corollary 6.4 (Quantum Stam inequality). Let px,PY be arbitrary states and consider the 
50 : 50 -beamsplitter map £ = £1/2 ■ Then 



> 



1 



1 



J(£(px ® Py)) ~ J(px) J(py) 
Proof. This follows from Theorem 16.21 by setting (A = 1/2) and 

J(Px)- 1 



w x 



J(px)- 1 + J(p Y ) 



and 



Wy 



J(P^ 



\-l 



J(px)- 1 + J(p Y ) 



□ 
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7 Quantum entropy power inequalities 

Having introduced the right quantum counterparts of the relevant classical concepts, the proof 
of the quantum entropy power inequalities is straightforward. We first provide a proof of the 



version (qEPT) for the transmissivity A-beamsplitter. It closely follows an argument given by 
Dembo, Cover and Thomas in [DCT91J for the classical statement (IcEPI'p . but relies on a slight 
adaptation that allows us to generalize it to the quantum setting. 

Theorem 7.1 (Quantum entropy power inequality for transmissivity A). Let < A < 1 and 

let £\ be the map (l4"5j) associated with a beamsplitter of transmissivity A. Let p x and p Y be 
arbitrary n-mode states. Then 

S(£x(px ® Py)) > \3(px) + (1 - ^)S(py) • (86) 

Proof. Define 

5(t) = S(e tc (£ x ( Px ®p Y ))) - XS(e tc ( Px )) - (1 - \)S(e tc (p Y )) . 

We want to show that 5(0) > 0. Since lim^oo 5(t) = by the asymptotic scaling of these 
entropies (Corollary 13. 411 . it suffices to show that 

—6{t) < for all t > . (87) 

Because {e tc } t is a semigroup, we get 

j t 5{t) = J(e tc (£ x (p x ® py))) - XJ(e tc ( Px )) - (1 - X)J(e tc (p Y )) , 

by the quantum de Bruijin identity (Theorem 15. ip . Because of the compatibility of E\ with e tc 
(Lemma 13. 2p . we can rewrite this as 

Aj f 6(t) = J(£ x (e tc ( Px ) ® e tc (p Y ))) - \J(e tc ( Px )) - (1 - X)J(e tC (p Y )) . 

Using the convexity inequality for Fisher information (Corollary 16. 31) . the inequality (157]) follows. 

□ 

Specializing Theorem 17.11 to Gaussian states, we obtain a concativity inequality for the 
entropy with respect to the covariance matrix. That is, denoting by p[y] the centered Gaussian 
state with covariance matrix 7, we have 

S(Ap[ 7 i] + (1 - A)p[ 72 ]) > AS(p[ 7 i]) + (1 - A)S(p[ 72 ]) for all < A < 1 . 



Finally, we can obtain the quantum analog ( qEPip of the classical entropy power inequal- 



ity ( IcEPip for the 50 : 50-beamsplitter, again mimicing a known classical proof. 

Theorem 7.2 (Entropy power inequality for 50:50 beamsplitter). Consider the map S = £1/2 
associated with a 50:50 beamsplitter. Let p x ,p Y be two n-mode states. Then 

e S(£(fix®PY))/n > \ e S{p x )/n _|_ _ e S(py)/n _ ^gg^ 

2 2 
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Proof. The proof follows Blachman [Bla65] , and is reproduced here for the reader's convenience. 
Define the functions 



F ^E X (F) 
G ^E Y (G) 
H ^E Z {H) 



exp (S(e FC (p x ))/n) 

exp (S(e GC ( PY ))/n) (90) 
exp(S(e HC (£(p x ®p Y )))/n) 



expressing the entropy powers of the states obtained by letting diffusion act on Px,Py and 
£{px ® Py) f° r times F, G and H > 0, respectively. According to Corollary 13.41 these functions 
have identical scaling 

E(T) ~ const. + Te/2 for T -»■ oo . (91) 

Moreover, since Ex and Ey are continuous functions on the positive real axis, Peano's 
existence theorem shows that the initial value problems 

Fit) = E x (F(t)), F(0) = 

G(t) = E Y {G{t)) , G(0) = 1 ; 

have (not necessarily unique) solutions F(-), G(-). Fix a pair of solutions and set 

H(t) = (F(t) + G(t))/2. (93) 

Since 

> efW*)- 1 ) > and G(t) > e^ G ^-^ > 



according to Theorem 13. 3[ these functions diverge, i.e., 

lim F(t) = lim G{t) = lim H(t) = oo . (94) 

t— >oo t— >oo t— >oo 

Consider the function 

_ Fx(F(t))+Fy(G(t)) 

W 2^(iJ(t)) 
With the initial conditions (19~2~)) . it follows that the claim (159|) is equivalent to 

5(0) > 1 . (95) 
Inequality ( 19 5 p follows from two claims: first, we have 

lim bit) = 1 , 

t— >oo 

as follows immediately from the asymptotic scaling (191 p of the entropy powers E, as well as ( I94p 
and the choice ( )93|) of H{t). Second, we claim that the derivative of 5 satisfies 

—6(t) > for all t > . (96) 

(J/L 
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Define, in analogy to ( 190]) . the Fisher informations of the states p x , Py and S(p x <8> Py) after 
diffusion for times F, G and H > as 



Jy(G) 
Jz(H) 



J (e FC ( P x)) 

J {e GC (p Y )) (97) 
J(e HC (£( Px ®p Y ))) . 



Using the fact that {e T£ }r is a semigroup, we can restate the quantum Bruijin identity (The- 
orem EHJ as 

E A (T) = ^-E A (T)J A (T) where A G {X, F, Z} (98) 

Note also that J X (F), Jy(G) and Jz{H) are related by Stam's inequality if H — (F + G)/2 is 
the average fl93|) of F and G: because 

e H£ (£(px ® py)) = £(e F£ (px) ® e G£ (py)) , 
by the compatibility of the beamsplitter with diffusion (Lemma 13.21) . we obtain 

MH) * J X (F) + M G) m 

using the quantum Stam inequality (Corollary 16. 4p and the nonnegativity of divergence-based 
Fisher information. Computing the derivative of 5, 

^ Ex(F(t))F(t) + E Y (G(t))G(t) E x (F(t)) + E Y (G(t)) . m( ,^ (A 
5it) = 2E z (H(t)) 2E z {H{t)f E ^)W) > 

inserting ( 198]) and suppressing the 5- dependence leads to 

8n6 = E X (F)J X (F)F + E Y {G)J Y {G)G - (E X (F) + E Y {G)) ■ E Z (H)J Z (H)H 
= E X (F) 2 .J X (F) + E Y (G) 2 J Y {G) - \{E X {F) + E Y (G)) 2 J Z (H) , 

where we inserted the definition ( 192]) of F and G. Replacing J Z (H) by the upper bound (l9"9l 
and further suppressing dependencies finally gives 

p2 t i T7i2 t /p , c J ^Jy 



> £^ J X + £ y Jy - (£ X + £y 



{E X J X — E Y Jy^ 2 



Jx + Jy 

y^y) 2 



Jx + Jy 

hence (]9"oT) follows from the nonnegativity of the Fisher information. □ 
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A The Q-function: Definition and Properties 

Define the Q-function in terms of the characteristic function by the expression 

Q(0 = J e-^ x (v)e- M2/ W n V . (100) 

Identity fllOOl) can be inverted to give 

x(v)=e M2/4 j e^Q(0d 2n ^. (101) 
Inserting the definition ( 1281) of the characteristic function into fllOOl) gives 

Comparing this with ( )32|) . we conclude that 

where |£) = D(£)|0) is a coherent state. 

This shows that Q(£) > for all £ G M 2n . Furthermore, since the coherent states satisfy 
the completeness relation 

1 r 

2n, 



(2tt)™ 



lexeme 



we have J <3(0<^ 2n £ = 1; i- e - ; Q is a probability density. 

The state p can be expressed in terms of Q using ( llOlj) and (12*71) : we have 

p = / Q(ri)a(r])d 2n ri where crfa) = — V / e^H'/V^Z^-O^ . (102) 
J (27r)" J 
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